Extended maximum a posterior linear regression (EMAPLR) model adaptation for speech recognition
نویسندگان
چکیده
In this paper, a new approach for model adaptation, extended maximum a posterior linear regression (EMAPLR), is described and studied. EMAPLR is an extension of maximum a posterior linear regression (MAPLR) for transform based model adaptation. The proposed approach has a close form solution under the elliptic symmetric matrix variate priors, and it is effective in our speech recognition experiments. EMAPLR is based on a direct MAPLR solution of the transform imageW s without explicitly solving the transformation matrix W . This is fundamentally different from conventional MAPLR and MLLR. Moreover, the proposed EMAPLR approach is incorporated with the structured prior evolution which significantly improves the algorithm efficiency and robustness. The structure of prior evolution in MAPLR is studied and it is shown that under the structured prior evolution, the priors in MAPLR follows a recursive formulation. Experimental results on WSJ (Spoke 3) non-native speaker adaptation task indicates that significant gain over MLLR and MAPLR can be obtained with same amount of adaptation data.
منابع مشابه
Quasi-Bayes linear regression for sequential learning of hidden Markov models
This paper presents an online/sequential linear regression adaptation framework for hidden Markov model (HMM) based speech recognition. Our attempt is to sequentially improve speaker-independent speech recognition system to handle the nonstationary environments via the linear regression adaptation of HMMs. A quasi-Bayes linear regression (QBLR) algorithm is developed to execute the sequential a...
متن کاملOnline speaker adaptation based on quasi-Bayes linear regression
This paper presents an online/sequential linear regression adaptation framework for hidden Markov model (HMM) based speech recognition. Our attempt is to sequentially improve speaker-independent (SI) speech recognizer to meet nonstationary environments via linear regression adaptation of SI HMM’s. A quasi-Bayes linear regression (QBLR) algorithm is developed to execute online adaptation where t...
متن کاملDiscriminative adaptation for log-linear acoustic models
Log-linear models have recently been used in acoustic modeling for speech recognition systems. This has been motivated by competitive results compared to systems based on Gaussian models, and a more direct parametrisation of the posterior model. To competitively use log-linear models for speech recognition, important methods, such as speaker adaptation, have to be reformulated in a log-linear f...
متن کاملMaximum a Posterior Linear Regression Based Variance Adaptation of Continuous Density Hmms
In this paper, the theoretical framework of maximum a posterior linear regression (MAPLR) based variance adaptation for continuous density HMMs is described. In our approach, a class of informative prior distribution for MAPLR based variance adaptation is identified, from which the close form solution of MAPLR based variance adaptation is obtained under its EM formulation. Effects of the propos...
متن کاملMaximum Likelihood Linear Regression (MLLR) for ASR Severity Based Adaptation to Help Dysarthric Speakers
Automatic speech recognition (ASR) for dysarthric speakers is one of the most challenging research areas. The lack of corpus for dysarthric speakers makes it even more difficult. The speaker adaptation (SA) is an alternative solution to overcome the lack of dysarthric speech and enhance the performance of ASR. This paper introduces the Severity-based adaptation, using small amount of speech dat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000